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A number of abundant mobile genetic elements called 
retrotransposons reverse transcribe RNA to generate DNA 
for insertion into eukaryotic genomes. Non-long-terminal 
repeat (non-LTR) retrotransposons represent a major class of 
retrotransposons, and transposons that move by target-primed 
reverse transcription lack LTRs characteristic of retroviruses 
and retroviral-like transposons. Yeast model systems in 
Candida albicans and Saccharomyces cerevisiae have been 
developed for the study of non-LTR retrotransposons. Non- 
LTR retrotransposons are divided into LINEs (long interspersed 
nuclear elements), SINEs (short interspersed nuclear elements), 
and SVA (SINE, VNTR, and Alu). LINE-1 elements have been 
described in fungi, and several families called Zorro elements 
have been detected from C. albicans. They are all members of 
LI clades. Through a mechanism named target-primed reverse 
transcription (TPRT), LINEs translocate the new copy into the 
target site to initiate DNA synthesis primed by the 3' OH of 
the broken strand. In this article, we describe some advances 
in the research on structural features and origin of non-LTR 
retrotransposons in C. albicans, and discuss mechanisms 
underlying their reverse transcription and integration of the 
donor copy into the target site. 



Introduction 

Candida albicans is a major human fungal pathogen. With the 
spread of AIDS and the increased use of invasive surgical tech- 
niques, C. albicans infections have become more of a problem in 
recent years. 1 C. albicans is an asexual eukaryote. However, Sac- 
charomyces cerevisiae can also reproduce sexually. 2 Several labora- 
tories have devoted considerable efforts over recent years toward 
understanding the genomic organization of C. albicans and 
how it varies among strains. Several results to date include the 
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construction of a Sfil restriction map of the complete genome 3 
and a detailed physical map of chromosome 7. 4 

C. albicans is an important model system for studying patho- 
genic fungi and interactions between these species and their 
hosts. Several researchers 5,6 reported the existence of a large 
number of families of retrotransposons in C. albicans, Ret- 
rotransposons should be are transcribed into mRNA molecules 
and then be reverse transcribed into double stranded cDNA by 
their own reverse transcriptase before the potential mobility of 
retrotransposons can be approximately predicted by the pres- 
ence of their mRNA transcript. 7 Retrotransposons are a signifi- 
cant component of many eukaryote genomes; for instance, LI 
retrotransposon comprises 15% of the human genome, 8 and is 
known to cause mutations and promote genomic alterations. 9 It 
is widespread in multicellular eukaryotes, and has an important 
effect on the structure of eukaryotic genomic and genetic evolu- 
tion. Two types of transposons have been classified: transposons 
that encode a transposase required for transposition, and retro- 
posons that use a retrotranscriptase encoded in their genome for 
retrotransposition. Transposons are found in a large variety of 
eukaryotes, and retrotransposons are part of different subfamilies 
of transposons. It is remarkable that retrotransposons are highly 
related to animal retroviruses with respect to gene organization 
and expression strategies. 10,11 

Retrotransposons are divided into two major categories. First, 
long-terminal-repeat (LTR) retrotransposons have structures and 
mechanisms similar to those of vertebrate retroviruses. The inte- 
grated forms of LTR retrotransposons are flanked by LTR at the 
end of both sides. Second, non-long-terminal repeat (non-LTR) 
retrotransposons that move by target-primed reverse transcription 
(TPRT), which emerged from the biochemical work of Luan and 
Eickbush 12 using the R2Bm model of Bombyx mori lacking the 
LTR retrotransposons characteristic of retroviruses and retrovi- 
ral-like transposons. Non-LTR retrotransposons are divided into 
LINEs (long interspersed nuclear elements), SINEs (short inter- 
spersed nuclear elements), and SVA (SINE, VNTR, and Alu). 
Non-LTR retrotransposons also contain a reverse transcriptase 
domain. Unlike LTR retrotransposons, they have no LTR ret- 
rotransposons, either direct or indirect. This review summarizes 



www.landesbioscience.com 



Virulence 



245 



The structure of LINEs 



Emm orfi 



Earns 



endo 



RT 



3 The structure of Zorro3 in C.albicans 



A n 



endo 



RT 



Q The structure of Zorrol in C.albicans 



cannot be 
identified* 



kyjli;| 0RF1 



zf zf 



Figure 1. Structure of non-LTR retrotransposons. (A) Structure of LINEs: LINEs family consists 
of two open reading frames, ORFI and ORF2. ORF1 encodes a RNA-binding protein that asso- 
ciates with the LINE transposition intermediate. ORF2 encodes endonuclease (endo), reverse 
transcriptase (RT), zinc finger domain (zf), and RNase H domains in some cases (not shown). 
Arrows are TSDs. A represents poly-A tail. (B) Structure of Zorro3 in C. albicans: ORFI contains 
two zinc knuckle (zk) motifs called type I ORF1, while human Lis contains a type II ORF1. Zorro3 
has no TSDs, with poly-A tract flanking both ends. (C) Structure of Zorrol in C. albicans. The end 
of 5'UTR cannot be identified. Unlike another non-LTR retrotransposons, neither a poly-A tract 
nor a 3' tandem repeat is apparent at the 3' end of Zorrol . 



the past and recent advances in the study of non-retrotransposon 
elements in C. albicans. Further delineation and comparison of 
non-LTR retrotransposons in C. albicans may provide interest- 
ing insights into more general aspects of the genome structure, 
function, and mechanism, though the integrated structure and 
mechanism remain unclear. 

LINEs Elements Found in C. albicans 

As we described above, LINEs (long interspersed nuclear ele- 
ments) are one of the three classes of non-LTR retrotransposons 
that influence the evolution of eukaryote genomes. 13 Complete 
mechanistic details of how LINEs duplicate and retrotranspose 
are unclear; however, a mechanism of the reverse transcription, 
termed target-primed reverse transcription (TPRT), has been 
reported. 12 In history, these elements which are called LINEs in 
generally today have been referred to by a variety of names, includ- 
ing poly A retrotransposons, nonviral retroposons, or simple ret- 
roposons. The first indication is that these elements were cata- 
lyzed by the retrotransposition machinery of LTR retrotranspo- 
sons or retroviruses. 14 The rapid accumulation of more sequences 
eventually leads to the recovery of elements from different ani- 
mals and plants with ORFs that encode intact RT domains. 
Phylogenetic comparison of these RT sequences with that of all 
other RT sequences revealed that they represented a distinct class 
of retrotransposons. 1516 It soon became known that RT domains 
of several elements could encode authentic RT DNA polymerase 
activity. 17 " 19 These elements are called LINEs retrotransposons 
today. The structure of LINEs is shown in Figure 1A. LINEs 
are 4-6 kbp in length and bounded by an untranslated region 
(UTR) at both ends of the element. 20 LINEs are characterized 
by 3' poly-A tails or 3' tandem repeats as other non-retrotrans- 
posons and transcribed from a promoter within the first few 



nucleotides of the element. Active LINEs fre- 
quently result in 5' truncated LINE copies. 21 
Most LINE elements are inactivated because 
of inefficiency of reverse transcription that 
is error-prone, so that ORFs encoding the 
transposition machinery are likely to be dis- 
abled by mutations, and not highly proces- 
sive, so that 5' truncation of the elements 
often occurs during transposition. A typical 
LINEs family consists of two open reading 
frames, ORFI and ORF2 (Fig. 1A). ORFI 
encodes a RNA-binding protein that associ- 
ates with the LINEs transposition interme- 
diate and nucleic acid chaperone activity, 22 " 25 
both of which are important for LINEs 
activity. 26,27 ORF2 encodes endonuclease, 28 
reverse transcriptase activity, 2 '' 1 zinc finger 
domain, and RNase H domains in some 
cases. Genomic LINEs, like human LI, are 
typically flanked by target site duplications 
(TSDs) as LTR-retrotransposons. ORFI and 
ORF2 proteins assemble with LINEs RNA 
into a ribonucleoprotein (RNP) complex, 30 
which is presumably transported into the nucleus. 31,32 

Multiple retrotransposons, consisting of non-LTR retrotrans- 
posons and LTR-retrotransposons, are flanked by 4-5 bp short 
direct repeats representing TSDs at 5' and 3' ends. For instance, 
36% of the total S. cerevisiae Ty 1-4 elements were flanked by 
TSDs, 33 and it is reported that Tea elements are also typically 
flanked by TSDs. 5 Analyzing the sequences of all the perfect 
TSDs of Tea elements in C. albicans 5 (Fig. 2A) and Ty elements 
in S. cerevisia^ (Fig. 2B) to derive a 4-5 bp TSDs target site 
sequence, a strong bias for A and T: in the internal position 2 
(72%), position 3 (76%), position 4 (78%), is shown in Fig- 
ure 2B. In Figure 2A, a bias for A and G is found in position 1 
(92%), a bias for T and C is shown in position 4 (71%). Recom- 
bination or mutation may result in the exchange of target site 
sequences between the elements. 

Many non-retrotransposons have been found in vertebrates, 
insects, and fungi. Human LI element has affected both the size 
and complexity of the human genome, 34 and varietal plant non- 
LTR retrotransposons have been reported, for instance, Cin4 
in maize 35 and BLIN (6.3 kbp in length) from barley. 36 So far, 
phylogenetic analysis of non-LTR retrotransposons based on 
the reverse transcriptase domains has allowed for distinguish- 
ing 21 clades. 37 " 43 Three clades (Tad, LI, and CRE) of non- 
LTR retrotransposons are known in fungi. LI clade elements 
were described from the genomes of C. albicans? 11 a basidio- 
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pora. 46 Unfortunately, S. cerevisiae appears to lack non-LTR 
retrotransposons. 33 

The existence of non-LTR retrotransposons in C. albicans has 
been reported. 4,5 Subsequently, Goodwin et al. 6 used a series of 
TBLASTN (protein query vs. nucleotide database) and BLASTN 
search 47 to screen non-LTR retrotransposons in assembling 5 of 
the Stanford C. albicans sequence database, and identified only 
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Figure 2. TSDs target site sequence flanking by retrotransposons. The direction of TSDs is 5' to 3'. (A) Four base pairs TSDs target site sequence ana- 
lyzed by Tea families in Candida albicans. Sample capacity, n = 24. (B) Five base pairs TSDs target site sequence analyzed by Ty families in Saccharomyces 
cerevisiae. Sample capacity, n = 118. 



a small number of sequences corresponding to non-LTR ret- 
rotransposons. Only three of them appear to be full-length or 
nearly full-length: Zorrol, Zorro2, and Zorro3 with 25—40% 
amino acid identity. 6 Zorro elements are widespread in C. albi- 
cans giving low copy numbers (data not shown by the original 
authors). 6 

The structures of the Zorro elements are shown in Figure IB 
and C. The structure of Zorro2 is similar to that of Zorrol, 
except that the ORFs have suffered several nonsense frameshift 
mutations and highly conserved residues can be identified. The 
intact Zorrol element (Fig. 1C) contains two ORFs, like many 
non-LTR retrotransposons. ORF1 containing two zinc-finger 
motifs potentially considered as putative nucleic acid-binding 
domains. ORF2 encodes a potential endonuclease (EN), a reverse 
transcriptase (RT), and a C-terminal. Upstream of ORF1 is a 5' 
untranslated region (5'UTR), and the end of 5'UTR cannot be 
identified. Comprising with 5'UTR, downstream of ORF2 is a 
3' untranslated region (3'UTR). The end of this 3'UTR can be 
tentatively identified; however, neither a poly-A tract nor a 3' 



tandem repeat is apparent at the 3' end of Zorrol. The Zorro3 
element is a structurally intact element. 48,49 It contains ORF1 
and ORF2, the first of which encodes two zinc-finger motifs 
(considered as putative nucleic acid-binding domains). ORF2 
of Zorro3 encodes an endonuclease (EN), a reverse transcrip- 
tase (RT), and a C-terminal. Zorro3 is bounded by 5'UTR at 
upstream of ORF1 and 3'UTR at downstream of ORF2. The 
end of 5'UTR of Zorro3 is characterized by a series of A resi- 
dues, and the end of 3'UTR can be identified as a short poly-A 
tract (itself bordered by poly-A). Interestingly, ORF2 of Zorro3 
is separated from a feature-like stop codon that contains four 
in-phase stop codons. But it was reported 50 that the gag and pol 
ORFs were separated by a UGA stop codon (gag-UGA-pol junc- 
tion) in the C. albicans retrotransposon Tca2. Forbes and Gibson 
et al. 50 demonstrated that the LTR promoter directed Tca2 pol 
protein expression and suggested that there was a non-canonical 
mechanism underlying gag UGA bypass in Tca2. Unfortunately, 
whether or not Zorro3's ORF2 directly translates stop codon 
bypass remains unclear. 
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Figure 3. The neighbor-joining (NJ) phylogenetic tree based on RT amino acid sequences of 
LI elements from fungi. The percentage of bootstrap support for major branches is indicated. 
The clade and families are shown on the right. The distance is the categories distance of the 
PROTDIST program of PHYLIP. 62 
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Figure 4. The neighbor-joining (NJ) phylogenetic tree based on RT sequences of Zorro ele- 
ments from C. albicans. The percentage of bootstrap support for major branches is indicated. 
The clade and families are shown on the right. The distance is the categories distance of the 
PROTDIST program of PHYLIP. 62 



Ll-like non-LTR retrotransposons were described for all 
eukaryotic groups: Protista, Plantae, Fungi, and Metazoa. 36,43,51 ' 52 
The neighbor-joining (NJ) phylogenetic tree based on reverse 
transcriptase of non-LTR retrotransposons reveals the position 
of the Zorro elements in LI non-LTR retrotransposons. Figure 3 
shows that the phylogenetic tree in distinct families is inside 
LI clade based on RT domain. Subsequently, three Zorro ele- 
ments emerge as a monophyletic group shown in Figure 4. These 
assignments are well supported by bootstrap re-sampling. It is 
remarkable that the three families of Zorro elements have been 
evolving independently for a very long time, and that they are 
probably extremely ancient components of the Candida genome. 

As we described below, Zorro elements in C. albicans are intact 
elements consisting of two ORFs, and ORF2 encodes an endonu- 
clease (EN), a reverse transcriptase (RT), and a C-terminal. An 
UTR is bounded at both ends of Zorro elements in C. albicans. 
However, comparing with another LI non-LTR retrotranspo- 
sons, for instance, human LI, which is a classical structure, these 
are series of differences between Zorro elements and another LI 
non-LTR retrotransposons (Fig. 1). Unlike human LI elemnets, 



neither a poly-A tract nor a 3' tandem repeat 
is apparent at the 3' end of this copy of Zorro 1 
and Zorro2. However, Zorro3 has a short poly- 
A tract at the end of 3'UTR. Another distin- 
guishing feature between Zorro3 and human 
LI elements is that Zorro elements contain 
two zinc-finger motifs in ORF1 instead of the 
conserved mammalian C-terminal domain. 
ORF1 contains two zinc knuckle motifs called 
type I ORF1, while human Lis contains a type 
II ORF1. 48 Another distinguishing feature is a 
19-bp poly-A tract in the inter ORF region. 

Translocation of LINEs Using 
Target-Primed Reverse Transcription 



The process of how LINE elements ret- 
rotranspose is called target-primed reverse 
transcription (TPRT), which is a mode of 
duplication and transposition of non-LTR 
retrotransposons by spreading through reverse 
transcription of retrotransposon RNA primed 
by DNA at the target site. By extension, it is 
likely that this mechanism applies to numer- 
ous LINEs found in diverse lineages, like 
human LI. 53 At first, a RNA binding protein 
with endonuclease (EN) activity encoded by 
ORF1, a multifunctional protein with reverse 
transcriptase (RT) activity encoded by ORF2, 
and the LI RNA transcribed from its inter- 
nal RNA polymerase II promoter located 
within the 5'UTR, 54 to compose a com- 
pound called LI ribonucleoprotein particle 
(RNP). 21 ' 55 RNP enters the nucleus and nicks 
a chromosomal target site for integration. The 
sequence of events in translocation is shown 
in Figure 5. Recombination begins with nicking of the target 
DNA by the element-encoded EN that preferentially cleaves A/T 
rich sequences, with nicking occurring mainly at the TpA and 
flanking phosphodiesters. The target DNA 3' OH exposed by 
endonuclease cleavage then acts as a primer for the synthesis of 
a new line DNA strand by reverse transcriptase using the line 
mRNA as a template. 56 Thus a new line DNA strand is produced 
at the insertion site. And then, the nuclease makes a break in the 
opposite strand of chromosomal DNA a few nucleotides from the 
first. Template RNA is removed by RNase H allowing the new 
3' OH to prime synthesis of the second DNA strand and host 
repair enzymes to complete integration. Finally, a second DNA 
strand is synthesized, and the target DNA at each end is filled in 
to generate the TSDs. 57 " 58 In addition, TPRT is mediated by the 
activities of both EN and RT domains; however, whether EN 
and RT are competitive inhibitors or non-competitive inhibitors 
remains unclear. 26,59 In RNP, the poly G RNA could inhibit LI 
EN activity. By DNA binding or action of LI ORF1, 22 ' 60 the poly 
G RNA may be removed from the LI EN domain, and LI EN 
activates to nick the chromosome at the target site. The nicked 
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Figure 5. The mechanism of target-primed reverse transcription (TPRT). 
Transposition begins with the transcription of the LINE element (red) 
into RNA (blue) which encodes an RNA binding protein and a multi- 
functional protein with endonuclease and reverse transcriptase activity. 
These proteins (not shown) associate with the LINE RNA, and the endo- 
nuclease nicks the DNA at the target site, which contains a poly T tract, 
which base-pairs with the poly A sequence in the LINE RNA. The LINE 
RNA is then copied by the reverse transcriptase into a DNA copy (green), 
which is covalently attached to the target DNA. A second DNA strand is 
then synthesized on the template of the DNA copy, and the target DNA 
at each end is filled in to generate the TSDs that flankthese elements. 



DNA moves to the RT active site and the newly generated 3' OH 
primes reverse transcription and double-strand breaks (DSBs) 
generated in trans. 

Goodwin et al. 49 developed a yeast model system using the 
Zorro3 element from C. albicans for the study of non-LTR 
retrotransposons. This system called retrotransposition assay 
for Zorro3 is outlined in Figure 6, in which the ORF of the 
C. albicans URA3 gene and its promoter sequence, with the 
ORF disrupted by an antisense intron inserted into 3'UTR of 
Zorro3 element, as the indicator gene. When Zorro3 is tran- 
scribing to give a full-length mRNA, and then the intron would 
be removed by splicing. Thus, retrotransposition events can be 
detected by the appearance of URA3 + colonies on the appro- 
priate selective media. After retrotransposition assay, 30 inde- 
pendent transposed copies were amplified to reveal not only 
the 3' and 5' ends but their 3' and 5' flanking sequences of 
retrotransposed Zorro3 elements. Several findings from these 
sequences indicate that the target site of Zorro3 elements which 
is inserted very close to coding regions specifically integrated 
at poly-A sequences, and there seemed to be a bias toward pro- 
moter regions. In addition, Goodwin et al. suggested that the 
transposable events in Zorro3 of C. albicans are similar to TPRT 
in mammalian cells. 

As we described above, non-LTR retrotransposons have never 
been found in S. cerevisiae that has no endogenous LI homologs 
or remnants. However, Poulter et al. 61 established a model sys- 
tem of S. cerevisiae called retrotransposition assay for scZorro3 
(Zorro3 in S. cerevisiae named ScZorro3), which has a similar 
process of retrotransposition assay for Zorro3 in C. albicans 
except using mHIS3AI as an indicator gene to confirm Zorro3 
retrotransposition, and found that S. cerevisiae unexpectedly 
retained the basal host machinery required for LI retrotrans- 
position. Through this model system called scZorro3 that reca- 
pitulates the non-LTR retrotransposition process in S. cerevisiae, 
they found several differences between Zorro3 of C. albicans 
retrotransposition and scZorro3 of S. cerevisiae retrotransposi- 
tion. For instance, the reverse transcription complex searches for 
sequences with homology to the minus strand to enable the tem- 
plate to jump during minus strand synthesis. In Zorro3 of C. albi- 
cans retrotransposition, this search is largely restricted to regions 
around the target site. In scZorro3 of S. cerevisiae retrotransposi- 
tion, this search space is relaxed, and template jumping occurs in 
other RNAs/DNAs at a higher frequency. In addition, scZorro3 
can generate a circular and episomal retrotransposition products 
in S. cerevisiae. 62 Previously, circular products derived from ret- 
roviruses or LTR-retrotransposons were observed. 63 " 65 Han and 
Shao suggested that these products are likely to be formed via a 
variation of TPRT. 63 For simplicity, bottom chromosome strand 
nick at first, and then LINE mRNA annealing, minus strand 
synthesis finally. Subsequently, the top strand chromosomal nick 
and the template jump to the top strand and then re-cleave the 
top and bottom strands to release the retrotransposition inter- 
mediate. These episomal products may represent an unexpected 
source for de novo retrotransposition. Yeast model systems of C. 
albicans and S. cerevisiae have been principally described, which 
have been developed for studying Zorro family elements and 



TPRT emerged by biochemical experiments with human LI and 
Zorro3 retrotransposon. However, complete mechanistic details 
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Figure 6. An assay for Zorro3 retrotransposition. The cloned Zorro3 element has a retrotransposi- 
tion indicator gene (URA3 promoter, and URA3 ORF, disrupted by an antisense intron) inserted 
into its 3' UTR. Reverse transcription and integration of the spliced RNA results in a functional and 
stably integrated URA3 gene and confers a URA3 + phenotype on the host cell. 



of how Zorro families of LINE elements retrotranspose remain 
unclear. 

TPRT is a process spreading through reverse transcription of 
retrotransposon RNA primed by DNA, effectively welding the 
new copy into the target site as it is made. It is a complicate pro- 
cess that LTR retrotransposons can move from place to place in a 
genome by reverse transcription of an RNA transposition medi- 
ated in cells (in this study, we do not describe in details). 66 Dis- 
tinguishing features of TPRT (as compared with the process that 
LTR retrotransposons transpose) are the RNP, consisting of LI 
RNA, proteins encoded by ORF1 and ORF2, enters the nucleus 
and nicks a chromosomal target site as the first step; however, 
no compound similar to RNP have been found in the process 
that LTR retrotransposons transpose. The target DNA 3' OH 
acts as a primer for the synthesis of a new line DNA strand in 
TPRT, whereas a tRNA base-paired to a sequence near 5' end of 
the genomic RNA, as a primer to anneals to binding site on ret- 
roviral RNA for the synthesis of minus strand DNA; 9 retroviral 
RNA ends in direct repeats (R), and results that a linear double- 
stranded DNA with an LTR at each end. 

Non-LTR Retrotransposons Play an Important Role 
in Evolutionary Dynamics of C. albicans 

The evolutionary history of a particular or related species, the 
population structure, ecological aspects, and the mating mode 
could affect the diversity of non-LTR retrotransposons and copy 
numbers. 67,68 For instance, LI elements play an important role 
in the evolution of the structure and activity of the remainder of 
the genome by providing dispersed sites of sequence similarity 
at which recombination can occur, by inserting into genes alter- 
ing their structure and/or regulation, and by carrying flanking 
sequences with them during transposition (Ll-mediated sequence 
transduction). 65 In addition, there are other processes that could 
affect the copy number and diversity of non-LTR retrotranspo- 
sons in fungi: stochastic loss of non-LTR retrotransposons, burst 



of retrotransposition, the limitation of copy 
number increase by natural selection which 
removes deleterious insertions, horizontal 
transfer, passive and active inactivation of 
repetitive sequences, and self-regulation 
of transposition. 67 ' 70,71 Low copy numbers 
of non-LTR retrotransposons could cause 
a loss of retrotransposons-like elements as 
a result of genetic drift, especially when 
the population is small and non-LTR ret- 
rotransposons degenerate copies. 72 It is 
reported that the presence of retrotranspo- 
sons and their large copy numbers can cause 
mutations and genomic rearrangements. 
These discoveries indicate that non-LTR 
retrotransposons and the transposition play 
an important role in evolutionary dynamics 
of C. albicans. 

The inactivation of repeated sequences is 
a very important factor, which leads to the 
shifts in diversity and copy number of non-LTR retrotranspo- 
sons. For instance, non-LTR retrotransoposons represented only 
by degenerate copies in Drosophila could lose these elements as 
a result of genetic drift, especially if the population is small. 72 
In bacteria, Tn retrotransposons are likely to be principal play- 
ers in the formation of tetracycline resistance by spreading drug 
resistance gene during genetic transfer. 73 In addition, the rela- 
tionship between resistance and virulence with reverse transpo- 
sition of retrotransposons is rarely reported, but in our original 
research, the transposition of Zorro2 and Zorro3 in strains that 
are resistant to miconazole and the strains show low virulence 
in a systemic murine candidiasis model, have been observed 
(unpublished). 

SINEs and SVA Elements are Rarely Reported 
in C. albicans 

We have summarized several past and recent advances in the 
study of LINEs including Zorro families in C. albicans. Unfor- 
tunately, little has been known about the distribution and prop- 
erties of SINES and SVA elements in C. albicans as compared 
with LINEs elements. However, much has been disclosed about 
the biology and function of SINEs and SVA elements since these 
elements were discovered. 

SINEs are genomic sequences derived from tRNA genes or 
7SL RNA, and they spread non-autonomously in the genome 
by TPRT mediated by LINE-encoded recombination proteins. 
The first described SINEs were mouse Bl and B2 74,75 and human 
Alu. 76 Today these elements are also found existing in other 
organisms, including fungi, insects, birds, and plants. SINEs 
are similar to LINEs in that both move via TPRT. 77 SINE ele- 
ments are much shorter (100-300 bp) than LINEs. A typical 
SINE consists of three parts: 78 5' ends of all SINEs families origi- 
nating from one of the three types of short pol III transcripts: 
tRNAs, 5S rRNA, or 7SL RNA. The 3' ends consist of poly A 
tails flanked by TSDs. The internal domain of the SINEs family 
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is usually unique and has no coding capacity. To date, 4 such 
domains have been described: CORE domain in vertebrates, 79 
V-domain in fish, 80 Deu-domain in deuterostomes, 81 and Ceph- 
domain in cephalopods. 82 

SVA elements for another group of non-autonomous retro- 
elements in humans and non-human primate, and are present 
at a relatively low copy number of a few thousand per genome. 
The SVA elements were originally named SINE-R. 83 It is named 
"SVA" after its main components (SINE, VNTR, and Alu) 
by Shen et al., 84 who identified the SINE-R element together 
with a stretch of sequence that shares sequence similarity with 
Alu sequences. The 3' ends of full-length SVA have the human 
endogenous retrovirus HERV-K, including the LTR and a 3' 
poly A tails, and TSDs flanking both ends of SVA elements. A 



(CCCTCT)n hexamer simple repeat region that is located at the 
5' end. The internal domain is composed of anyl/w-like sequence, 
a VNTR (variable number of tandem repeats) region, and a SINE 
region (SINE-R) about 490 bp. It is proposed that SVA elements 
are non-autonomous retrotransposons that are mobilized by LI 
encoded proteins in trans2. 85 
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